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^^ , Abstract 

m ■ 

In this paper, we develop an approach for optimizing the explicit binomial confidence 
/^ ' interval recently derived by Chen et al. The optimization reduces conservativeness while 

r ) • guaranteeing prescribed coverage probability. 

t^ ! 1 Explicit Formula of Chen et al. 

CN ■ Let X be a Bernoulli random variable defined in probability space ($7, ^, Pr) with distribution 

^f\ • Pv{X = 1} = 1 — Pr{X = 0} = p G (0, 1). It is a frequent problem to construct a confidence 

^^ ' interval for p based on n i.i.d. random samples Xi, • • • , X^ of X. 

T— I ■ Recently, Chen et al. have proposed an explicit confidence interval in [2] with lower confidence 

tJ" ■ limit 
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and upper confidence limit 
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where K = X^^Li ^i- Such confidence interval guarantees that the coverage probability Pr{L,„^5 < 
P < Un,5 I p} is greater than 1 — 5 for any p G (0, 1). 

Clearly, the explicit binomial confidence interval is conservative and it is desirable to optimize 
the confidence interval by tuning the parameter 5. This is objective of the next section. 
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2 Optimization of Explicit Binomial Confidence Interval 

As will be seen in Section 3, it can be shown that 

Theorem 1 For any fixed n andp £ (0, 1) , the coverage probability of confidence interval [L^^s, Un.s] 
decreases as 5 increases. 



Hence, it is possible to find 5 > a such that 

Pr {Ln,a <P< Un,a \ p} > 1 - a, Vp G (0, 1) 

for a £ (0, 1). To reduce conservatism of the confidence interval, we consider the following 
optimization problem: 

For a given a £ (0, 1), maximize 6 subject to the constraint that 

inf Pr {Ln,5 <P< Un,5 \p}>l- a. 

P6(0,l) 

A similar problem is to maximize 6 subject to the constraint that 

inf Pr {Ln,5 <p < Un,5 \p} >l- a- 

PG(0,1) 

As a result of Theorem 1, the maximum 6 can be obtained from (a, 1) by a bisection search. 
In this regard, it is essential to efficiently evaluate infpg(o.i) ^^{Ln,5 < P < ^n,<5 I p} and 
infpg(o^i) Pr{L„ ,5 < p < Un^s I p}- This is accomplished by the following theorem derived from 
the theory of random intervals established in [3] . 

Theorem 2 Let 6 £ (0, 1) . Define 
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/or /c = 0, 1, • • • , n. Define 

Ci{k) = Pr{[T"(^(A:))] < K < k-1 \ ^{k)}, C[{k) = Pr{[T"(^(A;))J + l <K<k-l\ ^(A;)} 

for A; e {0, 1, • • • , n} such that < ^(/c) < 1. Define 

Cu{k) = Pr{A;+l <K < [T+('^(yfc))J | ^(/t)}, C'^{k) = Pr{A;+l < A" < [T+(^(A:))1-1 | ^(/fc)} 

/or/c G {0, !,••• ,n} swc/i that Q < '^{k) < 1. 



Then, the following statements hold true: 

(I): infpg(o,i) ^^{Ln,5 < P < Un^s I p} equals to the minimum of 

{Ci{k) : < A: < n; < ^{k) < 1} U {C„(A;) : < A; < n; < ^(A;) < 1}. 

(II): infpg(-o^i) Pr{L„ ,5 < p < Un,s \ p} equals to the minimum of 

{C'lik) : < A; < n; < if (A:) < 1} U {C'^{k) : < k < n; < ^(A;) < 1}. 

The proof of Theorem 2 is provided in Section 4. 

3 Proof of Theorem 1 
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confidence hmits are Un,s = U{z) and Ln^s = L{z) respectively, where 



For simphcity of notations, define A = j^^a a-nd z = - Then, for K = k, the upper and lower 

S 



_ 3 l-2.+ Vl + 4Az(l-z) r .,^_ , , 3 1 - 2z - ^1 + 4A z(l - z) 

^ ^^) - ^ + 4 TTa ' ^^'> - ^ + 4 TTa • 

Since (1 — 2p)^ < 1 + 4A p{l —p) for p G (0, 1) and A > 0, we have L{z) < z and f/(z) > z. Hence, 
to show Theorem 1, it suffices to show that both U{z) — z and z — L{z) decrease as 6 increases for 
any z G [0, 1]. We shall first show that U{z) — z decreases as 5 increases for any fixed z G [0, 1]. 

For this purpose, we can define 

_ A[U{z) - z] 

^ 3 

and show that «f < 0. To this end, we can use the definition of y to obtain the following equation 
[(1 + \)y — (1 — 2z)]^ = 1 + 4A2;(1 — z). Differentiating both sides of this equation with respect 
to A yields 

Fill 1 

Az{l-z), 



2[{l + X)y-{l-2z)\ 
from which we have 



(l.A)|.. 



(1 + A)-^^- '^^'-'^ 



d\ {I + \)y - {I - 2z) ^' 

Clearly, to show ^ < 0, it suffices to show that the right-hand side of the above equation is 
negative for any z G [Oj 1] and A > 0. That is, to show 

2z(l - z) 

<y, 



y/l + 4\z{l-z) 
or equivalently, 

(1 + A)22;(l - z) < (1 - 2z)Vl + 4Az(l-z) + 1 + 4A z(l - z), 



which can be written as 2^(1 — z) < w{\), where 



Note that 



as a result of 



w{X) = (1 - 2z)y/l + AXz{l-z) + 1 + 2A z(l - z) 
dw{X) _ ,, . l-2z 



dX 
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V^l + 4Az(l-z) ^l + 4Az(l-z) 

Hence, w(A) > t(;(0) = 2(1 — 2) for any A > 0. This shows that 2z(l — z) < w{X) for any z G [0, 1] 
and A > 0. Consequently, we have established -^ < 0, which implies that U{z) — z decreases as 
5 increases for any fixed z € [0, 1]. Observing that L{z) = 1 — C/(l — z) for any z G [0, 1], we have 

z - L{z) =z-[l-U{l- z)] = [7(1 - z) - (1 - z). 

Therefore, it must be true that z — L{z) decreases as 5 increases for fixed any z S [0, 1]. So, the 
proof of Theorem 1 is completed. 



4 Proof of Theorem 2 

For simplicity of notations, we define A, z, L[z) and U{z) as in the proof of Theorem 1. Note 
that L{1) = 1 — „,,^,^ < 1 and the derivative of L{z) with respect to z is 
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which is positive if and only if (2A - l)yjl + 4Az(l - z) > 3A(1 - 2z). 

To complete the proof of Theorem 2, we need some preliminary results. 

Lemma 1 For any n > 1 and 6 G (0, 1), 

(2A-1)^(1 + A) ^1 
36A2 + 4A(2A -1)2-4 

if and only if X < -^ . 



(3) 



Proof. Note that ([3]) is equivalent to (2A- 1)2(1 + A) > 9A2 + A(2A- 1)^, which can be simphfied 
as (5A — 1)(A + 1) < 0. Since 6 S (0, 1) and n > 1, we have A > 0. Hence, the inequahty ^ holds 
if and only if A < ^ . 

D 



Lemma 2 L{z) is monotonically increasing with respect to z & [0, 1] such that L{z) > 0. Simi- 
larly, U{z) is monotonically increasing with respect to z £ [0, 1] such that U{z) < 1. 

Proof. We shall first show that L{z) is monotonically increasing with respect to z € [0, 1] such 
that L{z) > 0. It suffices to consider four cases: 

Case (i): A > | and < z < ^; 

Case (ii): A > ^ and 1 > z > i; 

Case (iii): A < ^ and < z < ^; 

Case (iv): A < ^ and 1 > z > ^. 

In Case (i), L{z) increases if and only if (2A — l)^[l+4Az(l — z)] > 9A^(1 — 22;)^, or equivalently, 

ly^ (2A-1)2(1 + A) 



2 J 36A2 + 4A(2A-1)2" 

Define 

^, 1 / (2A-1)2(1 + A) 



2 Y 36A2+4A(2A-1)2' 

By Lemma [H we have z* > 0. If follows that L{z) is monotonically decreasing with respect 
to z G (0,^*) and monotonically increasing with respect to z G (z*, g)- This implies that L{z) 
achieves its minimum at z* and L{z) < L{0) = for any z G (0, z*). Therefore, we have shown 
that L{z) is monotonically increasing with respect to z G (0, 1) such that L(z) > and that the 
conditions of Case (i) hold true. 

In Case (ii), L{z) increases for z G (^, l). 

In Case (iii), L{z) decreases for z G (O, ^]. It can be seen that L{z) < L(0) = for any 
zG(0,i]. 



In Case (iv), L{z) increases if and only if (2A — l)\/l + 4Az(l — z) > 3A(1 — 2z), which can 
be written as (1 — 2A)y^l + 4Az(l — z) < 3A(2z — 1) or equivalently, 



iV (2A-1)2(1 + A) 
z - - > 
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If ^ > A > |, by Lemmadl we have z* < 1. Hence, L{z) increases for z G {z*, 1) and 

3 l-2z- ^yl + 4X z(l - z) 2A - 1 , ^ , 

Moreover, L(z) decreases for z S (^,-2*) and 



/1\ 3 l-2z- Jl + AXz(l- z) 13 ,^ , 

If < A < ^, by Lemmadl we have z* > 1. Hence, L(z) decreases for z € (^, 1) and 

L{z)<l(^^^ <0 VzG Q,1 

Based on the preceding investigation, we can conclude that the lower confidence hmit is non- 
decreasing with respect to z S (0, 1) such that L{z) > 0. Recalhng that L{1) < 1, we have that 
L{z) < 1 for any z G (0, 1). 

Since U{z) = 1 — L{1 — z) > for any z G (0, 1), we have that the upper confidence limit U{z) 
is also non-decreasing with respect to z G (0, 1) such that U{z) < 1. 

a 



Now we consider the minimum coverage probabihty. By the definitions of Ln^s, Un^s and 
^(/c), ^(/c), we have 

Pr{L„,5 <p< Un,5 I ^(A;)} = Pr{A: <K < T+{p) \ ^(A;)}, < ^(A;) < 1 

Pr{Ln,5 <P< Un,5 I -^{k)} = Pr{T-(p) <K<k\ if (A:)}, < if (A:) < 1 

Pr{L„,5 <p< Un,5 I '^{k)} = Pr{A: <K < T+(p) | ^(A;)}, < "^{k) < 1 

PT{Ln,5 <P< Un,5 I -^{k)} = Pr{T-(p) < i^ < A; | i^(A;)}, < i^(A;) < 1. 

Since both L{z) and U{z) are monotone, the proof of Theorem 2 can be completed by making 
use of the above results and applying the theory of coverage probability of random intervals 
established by Chen in [3]. 
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